Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Free, publicly-accessible full text available July 16, 2026
-
Enhanced Zn anode kinetics and reversibility are achieved at a high ZUR by guiding Zn2+plating underlying the SnO1.17interphase with a regulated (101) orientation, surpassing those achieved by inducing Zn(002) plating overlying the interphase.more » « lessFree, publicly-accessible full text available April 15, 2026
-
The growth rate of the GPU memory capacity has not been able to keep up with that of the size of large language models (LLMs), hindering the model training process. In particular, activations—the intermediate tensors produced during forward propagation and reused in backward propagation—dominate the GPU memory use. This leads to high training overheads such as expensive weight update costs due to the small micro-batch size. To address this challenge, we propose SSDTrain, an adaptive activation offloading framework to high-capacity NVMe SSDs. SSDTrain reduces GPU memory usage without impacting performance by fully overlapping data transfers with computation. SSDTrain is compatible with popular deep learning frameworks like PyTorch, Megatron, and DeepSpeed, and it employs techniques such as tensor deduplication and forwarding to further enhance efficiency. We extensively experimented with popular LLMs like GPT, BERT, and T5. Results demonstrate that SSDTrain reduces 47% of the activation peak memory usage. At the same time, SSDTrain perfectly overlaps the I/O with the computation and incurs negligible overhead. Compared with keeping activations in GPU memory and layerwise full recomputation, SSDTrain achieves the best memory savings with negligible throughput loss. We further analyze how the reduced activation memory use may be leveraged to increase throughput by increasing micro-batch size and reducing pipeline parallelism bubbles.more » « lessFree, publicly-accessible full text available June 22, 2026
-
The overarching goal herein is to identify the factors dominating the performance of a‐IGZO‐based memristors. Despite the highest on/off ratio, greater than 104with a preferred minimal set/reset bias achieved from a‐IGZO‐based memristors, it is observed that the switching performance and stability/reliability of the devices is significantly dominated by theVO··density and metallization material, depending on their reactivity with IGZO. As the first governing factor, ensuring optimalVO··concentration in the switching layer IGZO (VO··/OOxratio 24.3% in this study) is crucial to obtain the tractable formation and rupture of conduction filament. Neither higher nor lowerVO··density than the optimized results in detrimental reliability issues, which may be ascribed to an uncontrollable filament in an abundant vacancy environment or a weak conducting path, respectively. As the second governing mechanism determining the memristor performance and reliability, it is suggested that metallization materials need to be carefully selected based on the thermodynamic redox potential and interfacial stability of the metallization material with IGZO. Metallization materials with larger reduction potential and interfacial stability are found to yield higher switching on/off ratio and greater device performance reliability.more » « less
An official website of the United States government
